Picture for Zheng Wei

Zheng Wei

KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving

Add code
May 13, 2026
Viaarxiv icon

FIS-DiT: Breaking the Few-Step Video Inference Barrier via Training-Free Frame Interleaved Sparsity

Add code
May 12, 2026
Viaarxiv icon

TACO: Efficient Communication Compression of Intermediate Tensors for Scalable Tensor-Parallel LLM Training

Add code
Apr 27, 2026
Viaarxiv icon

Hybrid Latent Reasoning with Decoupled Policy Optimization

Add code
Apr 22, 2026
Viaarxiv icon

WebForge: Breaking the Realism-Reproducibility-Scalability Trilemma in Browser Agent Benchmark

Add code
Apr 13, 2026
Viaarxiv icon

PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection

Add code
Apr 07, 2026
Viaarxiv icon

Efficient Document Parsing via Parallel Token Prediction

Add code
Mar 16, 2026
Viaarxiv icon

HMS-BERT: Hybrid Multi-Task Self-Training for Multilingual and Multi-Label Cyberbullying Detection

Add code
Mar 13, 2026
Viaarxiv icon

SesaHand: Enhancing 3D Hand Reconstruction via Controllable Generation with Semantic and Structural Alignment

Add code
Feb 28, 2026
Viaarxiv icon

Yunque DeepResearch Technical Report

Add code
Jan 27, 2026
Viaarxiv icon